F0 declination in English and Mandarin broadcast news speech
نویسندگان
چکیده
This study investigates F0 declination in broadcast news speech in English and Mandarin Chinese. The results demonstrate a strong relationship between utterance length and declination slope. Shorter utterances have steeper declination even after excluding the initial rising and final lowering effects. Both topline and baseline show declination, but they are independent. The topline and baseline have different patterns in Mandarin Chinese, whereas in English their patterns are similar. Mandarin Chinese has more and steeper declination than English, as well as wider pitch range and more F0 fluctuations.
منابع مشابه
Improved tone modeling for Mandarin broadcast news speech recognition
Tone has a crucial role in Mandarin speech in distinguishing ambiguous words. Most state-of-the-art Mandarin automatic speech recognition systems adopt embedded tone modeling, where tonal acoustic units are used and F0 features are appended to the spectral feature vector. In this paper, we combine the embedded aproach (using improved F0 smoothing) with explicit tone modeling in rescoring the ou...
متن کاملUnsupervised Learning of Tone and Pitch Accent
Recognition of tone and intonation is essential for speech recognition and language understanding. However, most approaches to this recognition task have relied upon extensive collections of manually tagged data obtained at substantial time and financial cost. In this paper, we explore unsupervised clustering approaches to recognize pitch accent in English and tones in Mandarin Chinese. In unsu...
متن کاملUnsupervised and Semi-supervised Learning of Tone and Pitch Accent
Recognition of tone and intonation is essential for speech recognition and language understanding. However, most approaches to this recognition task have relied upon extensive collections of manually tagged data obtained at substantial time and financial cost. In this paper, we explore two approaches to tone learning with substantially reductions in training data. We employ both unsupervised cl...
متن کاملMatbn 2002: a Mandarin Chinese Broadcast News Corpus
The MATBN 2002 Mandarin Chinese broadcast news corpus contains a total of 40 hours of broadcast news from Public Television Service Foundation (Taiwan) with corresponding transcripts. The primary motivation for this collection is to provide training and testing data for continuous speech recognition evaluation in the broadcast domain. We expect to collect and process 220 hours of Mandarin Chine...
متن کاملFundamental frequency in English and Mandarin: Production and perception
In a series of production experiments, the speaking F0 profiles (e.g. F0 range, mean, standard deviation) of English and Mandarin were compared for different kinds of speech samples: tone sweeps, isolated words, neutral prose passages, and lively reading of story character dialog. Whether the two languages differed depended on the particular speech samples being compared. Most notably, the phys...
متن کامل